Evolution of cis-regulatory sequences in Drosophila: a systematic approach
نویسندگان
چکیده
Numerous tools have been developed to align genomic sequences. However, their relative performance in specific applications remains poorly characterized. Alignments of protein-coding sequences typically have been benchmarked against "correct" alignments inferred from structural data. For noncoding sequences, where such independent validation is lacking, simulation provides an effective means to generate "correct" alignments with which to benchmark alignment tools. Using rates of non-coding sequence evolution estimated from the genus Drosophila, I simulated alignments over a range of divergence times under varying models incorporating point substitution, insertion/deletion events, and short blocks of constrained sequences such as those found in cis-regulatory regions. I then compared "correct" alignments generated by a modified version of the ROSE simulation platform to alignments of the simulated derived sequences produced by eight pairwise alignment tools (Avid, BlastZ, Chaos, ClustalW, DiAlign, Lagan, Needle, and WABA) to determine the off-the-shelf performance of each tool. As expected, the ability to align noncoding sequences accurately decreases with increasing divergence for all tools, 13 and declines faster in the presence of insertion/deletion evolution. Global alignment tools (Avid, ClustalW, Lagan, and Needle) typically have higher sensitivity over entire non-coding sequences as well as in constrained sequences. Local tools (BlastZ, Chaos, and WABA) have lower overall sensitivity as a consequence of incomplete coverage, but have high specificity to detect constrained sequences as well as high sensitivity within the subset of sequences they align. Tools such as DiAlign, which generate both local and global outputs, produce alignments of constrained sequences with both high sensitivity and specificity for divergence distances in the range of 1.25–3.0 substitutions per site. For species with genomic properties similar to Drosophila, I conclude that a single pair of optimally diverged species analyzed with a high performance alignment tool can yield accurate and specific alignments of functionally constrained non-coding sequences. Further algorithm development, optimization of alignment parameters, and benchmarking studies will be necessary to extract the maximal biological information from alignments of functional non-coding DNA.
منابع مشابه
Nomadic Enhancers: Tissue-Specific cis-Regulatory Elements of yellow Have Divergent Genomic Positions among Drosophila Species
cis-regulatory DNA sequences known as enhancers control gene expression in space and time. They are central to metazoan development and are often responsible for changes in gene regulation that contribute to phenotypic evolution. Here, we examine the sequence, function, and genomic location of enhancers controlling tissue- and cell-type specific expression of the yellow gene in six Drosophila s...
متن کاملDirect regulation of knot gene expression by Ultrabithorax and the evolution of cis-regulatory elements in Drosophila.
The regulation of development by Hox proteins is important in the evolution of animal morphology, but how the regulatory sequences of Hox-regulated target genes function and evolve is unclear. To understand the regulatory organization and evolution of a Hox target gene, we have identified a wing-specific cis-regulatory element controlling the knot gene, which is expressed in the developing Dros...
متن کاملGenomic inferences of the cis-regulatory nucleotide polymorphisms underlying gene expression differences between Drosophila melanogaster mating races.
Nucleotide sequence polymorphisms affecting gene expression occur in the regulatory region of genes (in cis) and elsewhere in the genome (in trans). Further study is required to weigh the relative importance of cis- and trans-acting mutations in mediating gene expression differences within and between species. Here, microarray hybridization experiments were used to isolate 363 gene expression d...
متن کاملMORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules
The discovery and analysis of cis-regulatory modules (CRMs) in metazoan genomes is crucial for understanding the transcriptional control of development and many other biological processes. Cross-species sequence comparison holds much promise for improving computational prediction of CRMs, for elucidating their binding site composition, and for understanding how they evolve. Current methods for ...
متن کاملThe structure and evolution of cis-regulatory regions: the shavenbaby story.
In this paper, we provide a historical account of the contribution of a single line of research to our current understanding of the structure of cis-regulatory regions and the genetic basis for morphological evolution. We revisit the experiments that shed light on the evolution of larval cuticular patterns within the genus Drosophila and the evolution and structure of the shavenbaby gene. We de...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007